Automating Document Review

ثبت نشده
چکیده

Law firms engaged in litigation expend significant time and resources on document review, a process requiring brief examination of thousands to hundreds of thousands of client documents. Often performed by first-year associates, the document review task is essentially a classification problem: documents need to be sorted into a number of categories, based on their content, their author(s), and a variety of other features. Due to the high volume of documents to be examined, and the often simple rules needed to classify documents, this process is ripe for assistance by a natural language processing system. This paper explores the desired features of an NLP document review system and demonstrates some results using a prototype. 1 Discovery and Document Review In large litigation cases or government investigations, law firms expend significant effort in the discovery process, during which litigants must produce internal documents relevant to the matter in question, and make them available either to the opposing counsel or in response to a government subpoena. The documents in question may include memoranda, financial statements, internal work papers, and a great deal of email. The widespread use of email by employees of large corporations has significantly increased the volume of documents that need to be reviewed as part of the discovery process. For the purposes of this exploration, we will focus on the task of classifying emails; many of the other documents in question are included as attachments to emails, and references to these documents in the email body may be used for classifying both types of documents.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Internet Engineering Task Force (ietf) Automating Dnssec Delegation Trust Maintenance

This document is subject to BCP 78 and the IETF Trust’s Legal Provisions Relating to IETF Documents (http://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text...

متن کامل

A Method for Automating Text Markup

Markup languages based on XML are increasingly popular, and languages for other formats such as RDF are under active development. One of the problems involved in converting legacy documents to use XML or other markup formats is the insertion of tags into the document and the consequent rearrangement of text required when markup is added to an existing, un-marked-up document. This paper describe...

متن کامل

Requirements Generation System

The Requirement Generation System (RGS) is a computer supported cooperative work (CSCW) tool that provides an interactive processing environment to define, control and structure mission requirements. RGS reduces the time and cost of developing requirements by automating many of the activities associated with the development, editing, review, approval and creation of requirements documents. User...

متن کامل

A Formalism of XML Restructuring Operations

We present a set of primitive restructuring operators that, when combined, are sufficiently powerful to convert an XML document under a source schema into an XML document under an arbitrary target schema. We initially define the operators at the schema level, and then show how each operator induces a corresponding transformation on any XML document under the schema. Finally, we note that our op...

متن کامل

A State-of-the-art Review on Multimodal Video Indexing

Efficient and effective handling of video documents depends on the availability of indexes. Manual indexing is unfeasible for large video collections. Effective indexing requires a multimodal approach in which either the most appropriate modality is selected or the different modalities are used in collaborative fashion. In this paper we focus on the similarities and differences between the moda...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006